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In this paper, we consider two-way orthogonal frequency division multiplexing (OFDM) relay 
channels, where the direct link between the two terminal nodes is too weak to be used for data 
transmission. The widely known per-subcarrier decode-and-forward (DF) relay strategy, treats each 
subcarrier as a separate channel, and performs independent channel coding over each subcarrier. We 
show that this per-subcarrier DF relay strategy is only a sub optimal DF relay strategy, and present a 
multi-subcarrier DF relay strategy which utilizes cross-subcarrier channel coding to achieve a larger 
rate region. We then propose an optimal resource allocation algorithm to characterize the achievable 
rate region of the multi-subcarrier DF relay strategy. The computational complexity of this algorithm is 
much smaller than that of standard Lagrangian duality optimization algorithms. We further analyze the 
asymptotic performance of two-way relay strategies including the above two DF relay strategies and an 
amplify-and-forward (AF) relay strategy. The analysis shows that the multi-subcarrier DF relay strategy 
tends to achieve the capacity region of the two-way OFDM relay channels in the low signal-to-noise 
ratio (SNR) regime, while the AF relay strategy tends to achieve the multiplexing gain region of the 
two-way OFDM relay channels in the high SNR regime. Numerical results are provided to justify all 
the analytical results and the efficacy of the proposed optimal resource allocation algorithm. 
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I. Introduction 

Orthogonal frequency division multiplexing (OFDM) relaying is a cost-efficient technique 
to enhance the coverage and throughput of future wireless networks, and it has been widely 
advocated in many 4G standards, such as IEEE 802.16m and 3GPP advanced long term evolution 
(LTE-Advanced) [2], [3]. In practice, a relay node operates in a half-duplex mode to avoid strong 
self-interference. However, since the half-duplex relay node can not transmit all the time (or over 
the entire frequency band), the benefits provided by the relay node are not fully exploited [4]. 

Recently, two-way relay technique has drawn extensive attention, because of its potential to 
improve the spectrum efficiency of one-way relay strategies [4]-[13]. If one utilizes traditional 
one-way relay strategies to realize two-way communications, four phases are needed. To improve 
the four-phase strategy, the two relay-to-destination phases can be combined into one broadcast 
phase [5], [6], and the yielded three-phase strategy can support the same data rates with less 
channel resource by exploiting the side information at the terminal nodes. One can further 
combine the two source-to-relay phases into one multiple-access phase to yield a two-phase 
strategy (see Fig. 1) [7]. Hybrid strategies with more phases have been considered in [8]— [10] to 
further enlarge the achievable rate region. The diversity- multiplexing tradeoff for two-way relay 
channels was studied in [1 1]— [13]. 

Two-way relay strategies also have been in conjunction with OFDM techniques [14]-[20]. 
With amplify-and-forward (AF) relay strategy, power allocation and subcarrier permutation have 
been studied in [14], [15], and its corresponding channel estimation problem has been thoroughly 
discussed in [16]. Resource allocation for two-way communications in an OFDM cellular network 
with both AF and decode-and-forward (DF) relay strategies was studies in [17]. A graph-based 
approach was proposed to solve the combinatorial resource allocation problem in [18]. For 
practical quality of service (QoS) requirements, the proportional fairness and transmission delay 
have been considered for two-way DF OFDM relay networks in [19] and [20], respectively. 
All these studies of two-way OFDM relay channels with a DF strategy were almost centered 
on a per-subcarrier DF relay strategy, which treats each subcarrier as a separate two-way relay 
channel, and performs independent channel coding over each subcarrier. Such a per-subcarrier 
DF relay strategy is probably motivated by the fact that per-subcarrier channel coding can achieve 
the capacity of point-to-point OFDM channels. However, the story is different in OFDM relay 
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(a) The multiple-access phase. (b) The broadcast phase. 

Fig. 1. System model of a two-way OFDM relay channel, consisting of (a) a multiple-access phase and (b) a broadcast phase. 



channels: per-subcarrier channel coding can no longer attain the optimal achievable rate region 
of DF relaying for two-way OFDM relay channels. In other words, per-subcarrier DF relaying 
is merely a suboptimal DF relay strategy. More details are provided in Section III, where an 
example is provided to show that a novel DF relay strategy achieves a larger rate region. 

This paper focuses on the two-way OFDM relay channel with a negligible direct link due to 
large path attenuation or heavy blockage. This is motivated by the fact that the relay node plays 
a more important role when the direct link is weak than when it is strong [13]. The optimal 
two-way relay strategy in this case consists of two phases, which are illustrated in Fig. 1. We 
intend to answer the following questions in this paper: What is the optimal DF relay strategy 
when the direct link is negligible? Under what conditions is the optimal DF relay strategy better 
(or worse) than the AF relay strategy, and vice versa? Is the optimal DF relay strategy able to 
achieve the capacity region of two-way OFDM relay channels in some scenarios? To address 
these questions, we also investigate the capacity region bounds of the two-way OFDM relay 
channels and compare different relay strategies under the optimal resource allocation. The main 
contributions of this paper are summarized as follows: 

• We present a multi-subcarrier DF relay strategy, which has a larger achievable rate region 
than the widely studied per-subcarrier DF relay strategy. Though this multi-subcarrier DF 
relay strategy is merely a simple extension of the existing result [7], it is the optimal DF 
relay strategy for two-way OFDM relay channels. 1 To the best of our knowledge, this 

'A strategy is the optimal DF relay strategy, meaning that its achievable rate region contains the rate region of any other DF 
relay strategy. It is worth mentioning that relay strategies other than DF relay strategies may have a larger or smaller achievable 
rate region compared to this multi-subcarrier DF relay strategy in certain scenarios. 
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TABLE I 



Comparison of per-subcarrier and multi-subcarrier DF two-way relay strategies. 



Strategy 



Achievable rate region 



Resource allocation complexity 



per-subcarrier DF 



small 



low [17] 



multi-subcarrier DF 



large 



very low 



multi-subcarrier DF relay strategy has not been reported in the open literature. We develop 
an optimal resource allocation algorithm to characterize the achievable rate region of the 
multi-subcarrier DF relay strategy. We show that the optimal resource allocation solution has 
a low-dimension structure. By exploiting this structure, the complexity of both primal and 
dual optimizations can be significantly reduced. The relative benefits of our multi-subcarrier 
DF relay strategy and its resource allocation algorithm are summarized in Table I. 
• We analyze the asymptotic performance of different relay strategies in the low and high 
signal-to-noise ratio (SNR) regimes under optimal resource allocation. First, we show that 
the multi-subcarrier DF relay strategy tends to achieve the capacity region of two-way 
OFDM relay channels in the low SNR regime. Then, we characterize the multiplexing 
gain regions of the two DF relay strategies, the AF relay strategy, and the cut-set outer 
bound under optimal resource allocation. We show that the AF relay strategy can achieve 
the multiplexing gain region of two-way OFDM relay channels in the high SNR regime. 
Numerical results are provided to justify our analytical results. The asymptotic performance 
comparison of AF and DF strategies is summarized in Table II. 
The rest of this paper is organized as follows. Section II presents the system model. Section III 
presents the multi-subcarrier two-way DF relay strategy and its achievable rate region. The 
resource allocation algorithm of the multi-subcarrier DF relay strategy is developed in Section IV. 
The asymptotic performance analysis of different relay strategies is provided in Section V. Some 
numerical results are presented in Section VI. Finally, Section VII draws some conclusions. 

Notation: Throughout this paper, we use bold lowercase letters to denote column vectors, and 
also denote annxl column vector by (x\ , . . . , x n ) . K+ and R™ denote the set of nonnegative real 
numbers and the set of n x 1 column vectors with nonnegative real components, respectively. 
p y means that each component of column vector p is nonnegative. I(X; Y) denotes the 
mutual information between random variables X and Y, and I(X; Y\Z) denotes the conditional 
mutual information of X and Y given Z. E[-] denotes the statistical expectation of the argument. 
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TABLE II 



Asymptotic performance comparison of AF and DF two-way relay strategies. 



Strategy 



Low SNR 



High SNR 



multi-subcarrier DF 



achieving capacity region 



smaller multiplexing gain 



AF 



lower rate 



achieving multiplexing gain region 



II. System Model 



We consider a two-way OFDM relay channel with N subcarriers, where two terminal nodes T\ 
and T 2 exchange messages by virtue of an intermediate relay node T R . The wireless transmissions 
in the two-way DF relay channel is composed of two phases: a multiple-access phase and a 
broadcast phase, as illustrated in Fig. 1. In the multiple-access phase, the terminal nodes Ti and 
T 2 simultaneously transmit their messages to the relay node T R . In the broadcast phase, the relay 
node T R decodes its received messages, re-encodes them into a new codeword, and broadcasts 
it to the terminal nodes 7\ and T 2 . The time proportion of the multiple- access phase is denoted 
as t for < t < 1, and thereby the time proportion of the broadcast phase is 1 — t. 

In the multiple-access phase, the received signal Y Rn of the relay node T R over subcarrier n 
can be expressed as . 



where X in (iE {1,2}) is the unit-power transmitted symbol of the terminal nodes Tj over 
subcarrier n, h in is the channel coefficient from Tj to T R over subcarrier n, Pi n is the average 
transmission power, and Z Rn is the independent complex Gaussian noise with zero mean and 
variance a 2 Rn . 

In the broadcast phase, the received signals of the terminal nodes T x and T 2 over subcarrier 
n are given by 



where X Rn and p Rn denote the unit-power transmitted symbol and the average transmission 
power of the relay node T R over subcarrier n, respectively, h in denotes the associated channel 
coefficient from T R to Tj over subcarrier n, and Z in is the independent complex Gaussian noise 
with zero mean and variance af n (i E {1,2}). 

Each node is subject to an individual average power constraint, which is given by 




(1) 




(2) 



(3) 



JV 



J^Pin < Pi, i = 1,2, R, (4) 



n=l 



where Pj denotes the maximum average transmission power of node Tj. Let us use P=(Pi, P 2 , Pr) 



to represent the maximum average powers of the three nodes, and use Q = {gi n , gm, gin, §2n} N 



n=l 



2 

in 



to represent the channel state information (CSI), where g in = \h in \ 2 / a Rn and gi n = \hi n \ 2 / 'a, 
(i G {1, 2}) represent the normalized channel power gains. We assume that the perfect CSI Q is 
available at the network controller to perform resource allocation throughout the paper. 

III. Optimal Two-Way OFDM DF Relay Strategy 

This section presents a multi-subcarrier DF relay strategy, which can realize the optimal 
achievable rate region of the DF relay strategy for two-way OFDM relay channels. We also show 
that the per-subcarrier DF relay strategy considered in [17]-[20] can only achieve a suboptimal 
rate region. 

Let R12 and R 2 i denote the end-to-end data rates from T\ to T 2 and from T 2 to T\, respectively. 
When the direct link between T\ and T 2 is negligible, the optimal DF relay strategy of discrete 
memoryless two-way relay channels was given by Theorem 2 in [7]. By applying this theorem to 
two-way parallel Gaussian relay channel and considering the optimal channel input distribution, 
we can obtain the optimal achievable rate region as stated in the following lemma: 

Lemma 1 Given the maximum, transmission powers P of the three nodes and the CSI Q, the 
optimal achievable rate region of the two-way parallel Gaussian relay channel (l)-(3) with a 
DF strategy is given by: 

n DF (P,g) = { (R 12 ,R2i)eR 2 + 

N N 



{iV J V ~ 

^no g2 (i+^),^(i-oio g2 (i+^ 



g2nPRn 



t 

n=l n=l 

N N 

ginPRr. 



^tlog 2 (l + ^),^(l-t)log 2 (l+^ 

n=l ™=i 
N 

R12 + R21 < ^tlog 2 (l 



n=l n=l 
N 

| ginPln + g2nV2r, 

n=l 

N 



t 



< t < 1, J^Pin < Pi, Pin > 0, i = 1, 2, R, n = 1, . . . , N \. (5) 



n=l 
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Proof: See Appendix A. ■ 
In fact, the optimal rate region of (5) is the intersection of the capacity regions of a parallel 
multi-access channel and a parallel broadcast channel with receiver side information 2 [21]. This 
rate region can be achieved by the following multi-subcarrier DF relay strategy: In the multiple- 
access phase, the relay node decodes the messages from the two terminal nodes by either 
successive cancellation decoding with time sharing/rate- splitting, or joint decoding [22]-[24]. 
In the broadcast phase, the relay node can utilize nested lattice codes, nested and algebraic 
superposition codes to transmit the messages to the intended destinations [9], [21]. Some related 
information theoretical random coding techniques were discussed in [6], [21], [25]. In either of 
the phases, channel encoding/decoding is performed jointly across all the subcarriers. 

On the other hand, the per-subcarrier DF relay strategy independently implements the DF relay 
scheme of [7] over each subcarrier [17]-[20]. The achievable rate region of the per-subcarrier 
two-way DF relay strategy is given by 

^ p ,df(P,£) = \ {Ri2,R2i)eR\ 



N r 

R 12 < 5>in t log 2 (l , (1 - t) log 2 (l+ £? 

n=l ^ 
N f 

R 21 < £ min { t log 2 (l+ ^p) , (1 - t) log 2 (l + 9 -\ _ 

n=l ^ 

N 

R\2 + R21 < 2^ tlo g2^H - f 

n=l 

N 

< t < 1, J2p™ < Pu Pin > 0, % = 1, 2, R, 

n=l 

The only difference between TZd F (P,S) and 7Z p ,df (P,G) lies in the order of the function 
min{-} and the summation in (5) and (6), implying TZ p ,df(P,G) Q 1Zdf(P,G)- Therefore, the 
per-subcarrier DF relay strategy is only a suboptimal DF relay strategy. Similar results have been 
reported in [26], [27] for one-way parallel relay channels. 

We now provide a toy example to compare these two DF relay strategies. Consider a two-way 
OFDM relay channel with N = 2 subcarriers. The wireless channel power gains are given by 
(<7iij 9i2, 921,922) = (1, 15, 7, 3) and g in = g in for n, i G {1, 2}. The power and channel resources 



92nPRn 
T 

ginPRn 



2 Here, the receiver side information means each user's own transmitted message. 
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Fig. 2. Comparison of the achievable rate regions of different two-way OFDM relay strategies for a toy example with 

N = 2, (311,. gi2, 921,522) = (1, 15, 7,3), g in = g in , p in = 0.5, t = 0.5. 

are fixed to be pi n = 0.5 and t = 0.5. According to Lemma 1, the achievable rate region of the 
multi-subcarrier DF relay strategy with fixed resource allocation is given by the set of rate pairs 
(i?i2,-R2i) satisfying 

R 12 < min{0.5 + 2, 1.5 + 1} = 2.5 bits/s/Hz, (7a) 
#21 < min{1.5 + 1, 0.5 + 2} = 2.5 bits/s/Hz, (7b) 
R12 + R21 < 0.5 [log 2 (9) + log 2 (19)] = 3.71 bits/s/Hz. (7c) 

Similarly, by (6), the achievable rate region of the per-subcarrier DF relay strategy with fixed 
resource allocation is given by the set of rate pairs (-R12, -R21) satisfying 

R 12 < min{0.5, 1.5} + min{2, 1} = 1.5 bits/s/Hz, (8a) 
R21 < min{1.5, 0.5} + min{l, 2} = 1.5 bits/s/Hz, (8b) 
R12 + R21 < 0.5 [log 2 (9) + log 2 (19)] = 3.71 bits/s/Hz, (8c) 

where the sum-rate constraint is actually inactive. The achievable rate regions in (7) and (8) are 
shown in Fig. 2, from which one can easily observe that the considered multi-subcarrier DF relay 
strategy has a larger achievable rate region. An effective and computationally efficient approach 
for the optimal resource allocation of the proposed two-way DF strategy will be presented in 
the next section. 



Multi-subcarrier DF 

Per-subcarrier DF 



(1.21,2.5) 




,(2.5,1.21) 
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IV. Resource Allocation Algorithm 

We now develop a resource allocation algorithm to characterize the boundary of the achievable 
rate region 1Z DF (P,Q) in (5). We will show that the optimal resource allocation solution has a 
low-dimension structure, and thereby the number of dual variables to be optimized is reduced; 
see Propositions 1 and 2 below for more details. The complexity of our resource allocation 
algorithm turns out to be much lower than that of the standard Lagrangian dual optimization 
algorithm and the existing resource allocation algorithm reported in [17]. 



A. Resource Allocation Problem Formulation 

Let p E (0, oo) denote the rate ratio of the two terminal nodes, i.e., 

P = R21/R12. (9) 

Then, a boundary point (Rn,R2i) = (R12, pRu) of the achievable rate region 1Zbf(P,G) is 
attained by maximizing R± 2 within 1Z DF (P, Q) for a given rate ratio p. Therefore, the boundary 
point of TZ DF (P,Q) is characterized by the following resource allocation problem: 

max R12 (10a) 

N 



s.t. 



^12<t$>g 2 (l + ^), dOb) 

n=l 
+ N 

# 12 <-£log 2 (l+^), (10c) 



p n =l 



N 



R12 < —2^^2(1 + ~ t ), (10d) 

P n=l 

N 

R 12 <(l-t)J2^2(l+ 9 -fzf), ( 10e ) 



n=l 
N 



N 



P n=l 



^Pin < Pi, i = 1,2, R, (lOg) 



n=l 



< t < 1, (lOh) 

where p L = (pa,Pi2, ■ ■ ■ ,P%n) S M/f denotes the power allocation of node Tj for i=l,2,R. 
Problem (10) is a convex optimization problem, which can be solved by standard interior-point 
methods or by using general purpose convex solvers such as CVX [28]. However, these methods 
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quickly become computationally formidable as the number of subcarriers iV increases, because 
their complexity grows in the order of 0(N 3 - 5 ) [29], [30, p. 8 and Eq. (11.29)]. Since N can 
be quite large in practical OFDM systems, we will develop a more efficient resource allocation 
algorithm for large values of N in the sequel. 

B. Phase-Wise Decomposition of Problem (10) 

Let us first fix the value of t. Then, problem (10) can be decomposed into two power allocation 
subproblems for the multi-access phase and the broadcast phase, respectively. Note that the 
transmission powers of the terminal nodes pi and p 2 are only involved in the rate constraints 
(10b)-(10d) for the multiple-access phase, while the transmission power of the relay node p R 
is only involved in the rate constraints (lOe) and (lOf) for the broadcast phase. Let Rma and 
Rbc denote the achievable rates for the multiple-access and broadcast phases, respectively. For 
any fixed t, problem (10) can be decomposed into the following two subproblems, one for the 
multiple-access phase 

R MA (t) = max R MA (11a) 

s.t. Rma < r k (p 1 ,p 2 ), k = 1,2,3, (lib) 

N 

^Pin<Pi, 3 = 1,2, (11C) 
n=l 

and the other for the broadcast phase 

R^ c (t) 4 max i? BC (12a) 
s.t. R B c<r k (p R ), fc = 4,5, (12b) 

N 

Y,PRn<P R , (12c) 

n=l 

where the rate functions r k (pi,p 2 ), k = 1, 2, 3, and r k (p R ), k = 4, 5, are defined by 

N 

ri(pi,p 2 )=t^log 2 (l + ^), (13a) 

n=l 

N 

r.b^H-E^ + ^l (13b) 

P n=l 
, N 

I \ £ V^l I 1 , QlnPln + 92nP2n\ s 

r3{Pi,P2) = ——. t Vlog 2 1 + 7 ), (13c) 

p + 1 ^— \ v t / 

' n=l 



1 1 

N 

n{ PR ) = (i-*)£log 2 (i+f^f ), (13d) 

n=l 

TV 

r 6 (PiO = — E^( 1+ t^> < 13e ) 

P n=l 

Then, the optimal objective value of problem (10) is given by 

R\ 2 = max min{i4 A (t), R* c (t)} , (14) 

where R MA (t) and R# c (t) are defined in (11) and (12), respectively. Since problem (14) itself 
is a one-dimensional convex optimization problem, it can be efficiently solved by either golden 
section search method or the bisection method [31, Chapter 8], with R MA (t) an d -Rbc(^) at eacn 
search iteration obtained by solving (11) and (12), respectively. Next, let us show how to solve 
the subproblems (11) and (12), respectively. 

C. Lagrange Dual Optimization for Subproblem (11) 

Let us define the partial Lagrange dual function of subproblem (11) as 

D M a{\ol)= min L M a (pi, P2, Rma, A, a) , (15) 

Pl,P2^0,R M A 

where A = (Ai,A 2 ,A 3 ), a = («i,a 2 ) are nonnegative dual variables associated with three rate 
inequality constraints in (lib) and two power inequality constraints in (11c), respectively, and 

3 2 N 

Lma (Pi)P2) Rma, A, at) = —Rma + ^ ] [-Rma — r k (pi,P2)] +J2 ai (j2 Pm ~ Pi ) (16) 

fc=l i=l n=l 

is the partial Lagrangian of (11). Then, the corresponding dual problem is defined as 

max D MA (\,ct). (17) 

Since the refined Slater's condition [30, Eq. (5.27)] is satisfied in problem (11), the duality gap 
between problems (11) and (17) is zero, i.e., solving problem (17) in the dual domain will yield 
the optimal solution of the primal problem (11). 

1 ) Structure of the Optimal Dual Solution X*: Prior to the presentation of our power allocation 
algorithm for solving the problems (15) and (17), we first present an important result that the 
optimal dual solution A* satisfies the following structural property: 

Proposition 1 There exists one optimal solution (A*, a*) to the dual problem (17), where A* = 
(1-A£, 0, A3) or A* = (0, 1-A^, A3) and < A£ < 1. 
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Proof: See Appendix B. ■ 
Proposition 1 is very useful for developing our power allocation algorithm, because the search 
region for A* can be confined to a set A x |J A 2 , where Ai and A 2 are two one-dimensional dual 
sets defined by 

Ai 4 {A G R 3 + | A = (1 - A 3 , 0, A 3 ), < A 3 < 1}, (18a) 
A 2 = {A G IR+ | A = (0, 1 - A 3 , A 3 ), 0<A 3 <1}. (18b) 

In the sequel, we will show that finding solutions to both problems (15) and (17) can be 
substantially simplified by virtue of Proposition 1 . 

2) Primal Solution to Problem (15): As the first important application of Proposition 1, we 
show that the structure of A* can be exploited to simplify the primal solution to problem (15). 
For any given dual variables (A, a), the optimal power allocation solution (Pi n ,p 2n ) t0 problem 
(15) is determined by the following Karush-Kuhn-Tucker (KKT) conditions: 

3Lma = tg ln X 3 /(p + l) tg ln \ 1 ( > 0, iftf„ = 0; 

dpin ^ (t + ginPtn + 92nP 2n ) ln 2 + 9lnP\ n ) In 2 \ = Q, if p£„ > 0, 

dL MA tg 2n X 3 /{p + l) tg 2n \ 2 /p J > 0, ifp£ n = 0; 

= o; 2 \ (iyt)) 

<9p2n (t + ^InPln + ^2nP 2n ) In 2 (t + g 2n P 2n ) ln 2 1=0, if f 2n > °" 

According to Proposition 1, at least one of A£ and A 2 is 0, which can be utilized to simplify 
the KKT conditions (19). The attained optimal (Pi n ,p 2n ) is provided in the following four cases: 
Case 1: p* ln > 0, p* n > 0. If A = (1-A 3 , 0, A 3 ), then 

P * n = (a/- 1 £a 2 3 jln2-£' W 

tXs t(l-A 3 ) 

P2n (p + l)a 2 ln2 (^ ai - a2 )ln2' ^ j 

otherwise, if A = (0, 1— A3, A 3 ), then 

v * = t_h ^-^3) f21a) 

Pln (p + l)ailn2 p(ga 2 -ai)ln2' V ; 

V\n= . t{1 ~ n X3 \, (21b) 
p("2 - In 2 # 2n 

Case 1 happens if Pi n p 2n > is satisfied in (20) or (21). 

Case 2: > 0, p 2n = 0. Then the solutions to (19a) and (19b) are given by 

_ t[(p + l)A 1 + A 3 ] t 

Pln ~ ( P +l) ai ln2 g ln > (22a) 

P2„= 0. (22b) 
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This case happens if p\ n > is satisfied in (22a). 



Case 3: p\ n = 0, p\ n > 0. Then the solutions to (19a) and (19b) are given by 



P\n= 0, 

t[(p + l)A 2 + P A 3 ] t 
P2n ~ p(p + l)a 2 ln2 g 2n 



(23b) 



(23a) 



This case happens if p\ n > is satisfied in (23b). 

Case 4: p* n = 0, p\ n = 0. This is the default case when the above 3 cases do not happen. 
Remark 1 If the structural property of A* in Proposition 1 is not available, one can still obtain 
an alternative closed-form solution to (19) [1]. However, this solution involves solving a more 
difficult cubic equation, which is presented in Appendix C. Nevertheless, these two closed-form 
solutions are much simpler than the iterative power allocation procedure proposed in [17] for 
the per-subcarrier DF relay strategy. 

Remark 2 Since the Lagrangian (16) is not strictly convex with respect to the primal power 
variables at some dual points, the power allocation solution in (20)-(23) may be non-unique at 
those dual points. For example, if A 3 = 1 and a\ = the power allocation solution in either 

of (20) and (21) is indeterminate due to the presence of -jj form. In fact, any nonnegative (p* n ,P2n) 
satisfying gi n p* n + 9inV%, n = ai {p+i)in2 ~~ ^ 1S a SOiut i° n t0 (19) in this case. Nevertheless, any 
one of the optimal primal power solutions can be used to derive the subgradient for solving 
the dual problem (17) [32, Section 6.1]. After the optimal dual point (A*, a*) is obtained, we 
still need to recover a feasible solution to the primal problem (11) [32]-[34]. According to [32, 
Proposition 5.1.1], this can be accomplished by incorporating the feasibility conditions (49h) and 
the complementary slackness conditions (49i), which are contained in the KKT conditions of 
problem (11) given in Appendix B, into the power allocation solutions (20)-(23), which involves 
solving a system of linear equations and inequalities. 

3) Dual Solution to Problem {11): We now solve the dual problem (17) by a two-level 
optimization approach [17], which first fixes A and searches for the optimal solution ck*(A) to 
the maximization problem 



Gma(A) = max D MA (\, a) 



(24) 



and then optimizes A by 



A* = argmax Gma(A). 



(25) 
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The inner-level optimization problem (24) is solved by an ellipsoid method [35] summarized in 
Algorithm 1, where the subgradient of the dual problem D MA (\,ot) with respect to at is given 
by [32, Proposition 6.1.1] 

(N N \ 

XX, -Pi, XX, -P 2 , (26) 
n=l n=l / 

where G»i n ,p 2n ) * s me optimal power allocation solution obtained by (20)-(23). More details 
about the initialization of ex and the matrix A in Algorithm 1 are given in Appendix D. 

By Proposition 1, the outer-level optimization problem (25) can be solved by searching for 
A* over the set Ai [j A 2 , i.e., 

A* = arg max G MA (A). (27) 

AeAiUA 2 

In order to solve the reduced outer-level optimization problem (27), we first need the sub- 
gradient of the objective function G M a(A). According to [32] and [36, Corollary 4.5.3], one 
subgradient of G M a(A) in (24) is given by 

£(A) = (R* MA - n, - r 2 , Ru A - r 3 ) , (28) 

where -R^a = min{ri, r 2 , r 3 }, and r k (k— 1,2,3) are the rate functions (13a)-(13c) associated 
with the optimal primal power allocation solution (20)-(23) obtained at the dual point (A, a*(A)), 
respectively, and e**(A) is the optimal solution to (24). 

With the subgradient £(A) of Gma(A), we are ready to solve the outer-level optimization 
problem (27). Instead of searching both the sets Ai and A 2 , we propose a simple testing method 
to determine whether A* G Ai or A* e A 2 . Noticing that Ai f] A 2 = {(0, 0, 1)}, let us consider a 
testing method at the dual point A = (0, 0, 1). By the concavity of the dual function £> M a(A, a), 
GW(A) is also concave in A, which implies [32, Eq. (B.21)] 

Gma(A) < G MA (A°) + (A - A°f £(A°), V A e A x [j A 2 . (29) 

Suppose that A* is an optimal solution to (27), i.e., G M a(A*) > G M a(A°). Then, by (29), we 
must have 

(A* - A°) T £(A°) > (30) 

for the optimal dual point A*. In other words, if a dual point A satisfies (A — A°) T £(A°) < 0, 
then A cannot be an optimal solution to problem (27). Due to this and (28), we establish the 
following proposition: 
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Algorithm 1 The ellipsoid method for solving the inner-level problem (24) 
l: Input CSI {gin, g2n}n=i> average powers {P 1 ,P 2 }, rate ratio p, time proportion t, and A. 

2: Initialize a and a 2 x 2 positive definite matrix A that define the ellipsoid £(&, A) = 

{x G R 2 + | O - a) T A _1 (a; - a) < 1}. 

3: repeat 

4: Compute the optimal (pi n ,p 2n ) by (20)-(23) for given (A, a) and t. 
5: Compute the subgradient ?7(A, a) with respect to ex. by (26). 

6: Update the ellipsoid: (a) rj := . 1 77; (b) a := a-±Ar7; (c) A := f (A-f Am7 T A). 

7: until ck converges to a* (A). 

8: Output the optimal dual variable ck*(A) for given A. 



Proposition 2 Let (/c=l,2,3) denote the values of the terms used in the subgradient £(A) 
m (28) with X = X°. If r 3 > t\, then A* G A x . If r 3 > r 2 , then X* G A 2 . //" r 3 > r x and 
r 3 > r 2 , then A* = A = (0, 0, 1). 

Proof: See Appendix E. ■ 
The procedure for solving (27) is given as follows: First, we utilize the preceding testing 
method stated in Proposition 2 to determine whether A* G A x or A*gA 2 . Then, we use the 
bisection method to find the optimal dual variable A*. If A*=(l — A3, 0, A3) G A x , the directional 
subgradient C(^3) °f GW(A) along the direction of A 1 is determined by 

8X 

C(A 3 ) = £ (Af — = £(A) T (-1, 0, 1) = n - r 3 ; (31) 

otherwise, if A* = (0, 1 — A3, A3) G A 2 , the directional subgradient C(^3) along the direction of 
A 2 is determined by 

C(A 3 )=£(A) T (0,-l,l)=r 2 -r 3 . (32) 

Since G M a(A) is concave in A, it is also concave along the direction of A x (or A 2 ). Thus, 
C(A 3 ) is monotonically non-increasing with respect to A3. Therefore, we can use the bisection 
method to search for the optimal solution A3 to (25), which satisfies ((X£) = 0, if < A 3 < 1; 
C(Ag) < 0, if A3 = 0; or ((\ 3 ) > 0, if A 3 = 1. The obtained algorithm for solving subproblem 
(11) is summarized in Algorithm 2. 
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Algorithm 2 Proposed duality-based algorithm for solving subproblem (11) 

l: Input CSI {gin, 92n}n=v average powers {P 1 ,P 2 }, rate ratio p, and time proportion t. 

2: Check whether A* G A 1 or A* £ A 2 by Proposition 2. If A* = A = (0, 0, 1), go to Step 10; 

otherwise, find A* by the bisection method in Steps 3-9. 

3: Initialize A 3jmin = 0, A 3iinax = 1. 

4: repeat 

5: Update A 3 = |(A 3jmin + A 3iinax ). 

6: Derive a* (A) for the inner- level dual optimization problem (24) by Algorithm 1. 
7: Compute the subgradient £(A) by (28) and the subgradient C(-^3) by either (31) or (32). 
8: If the subgradient C(^3) < 0, then update A 3)max = A 3 ; else update A 3 min = A 3 . 

9: until A 3 converges. 

10: Obtain the optimal {p^p^} by (20)-(23) and Remark 2. 

11: Output the optimal power allocation solution {Pi,?^} anc ^ me optimal rate _Rj^ A (t). 



D. Lagrange Dual Optimization for Subproblem (12) 

The partial Lagrange dual function of subproblem (12) is defined as 

£>bc(A 4 , A 5 ,a 3 ) = min L BC (Pr, Rbc, A 4 , A 5 , a 3 ) , (33) 
PRhO,R B c 

where A 4 , A5 and a 3 are the nonnegative dual variables associated with two rate inequality 
constraints in (12b) and one power inequality constraint (12c), respectively, and 

5 N 

Lbc (Pr, Rbc, A 4 , A 5 , a 3 ) = —Rbc + ^ A fc [R BC - r k (p R )] + a 3 (^^PRn - Pr^J • (34) 

k=4 n=l 

Then, the corresponding dual optimization problem is defined as 

max D BC (A 4 , A 5 ,a 3 ). (35) 

A4>0,A 5 >0,a 3 >0 

The KKT conditions associated with (33) are given by 

OLbc = (l-t)hnK (1 ~ t)g ln X 5 I > 0, ifp* Rn = 0; 

dpRn 3 (1 - t + hnpRn) In 2 p(l - t + gmP Rn ) In 2 I = Q) if p * Rn > Qj 

^ = K + K-1 = 0. (36b) 
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Algorithm 3 Proposed duality-based algorithm for solving subproblem (12) 

l: Input CSI {gi n , g2n}n=i> average power P R , rate ratio p, and time proportion t. 

2: Initialize A 5jmin = 0, A 5iinax = I. 

3: repeat 

4: Update A 5 = |(A 5imin + A 5>max ) and initialize a 3imin , a 3jmax with given A 5 . 
5: repeat 

6: Update a 3 = |(a 3jmin + a 3imax ). 

7: Obtain the optimal p R by solving (37) at the dual point (1 — A 5 , A 5 , a 3 ). 

8: If Y,n=iPRn < p R> tnen update a 3 , max = a 3 ; else update a 3 , min = a 3 . 

9: until a 3 converges to a^As). 

10: Obtain the optimal p R by solving (37) at the dual point (1 — A 5 , A 5 , a 3 (A 5 )). 

11: If r A (p R ) < r 5 (p* R ), then update A 5>max = A 5 ; else update A 5imin = A 5 . 

12: until A 5 converges. 

13: Obtain the optimal p R by (37). 

14: Output the optimal power allocation solution p* R and the optimal rate R# c (t). 



If p Rn > 0, then the equality in (36a) holds, and the optimal p* Rn can be shown to be the positive 

root x of the following quadratic equation [17] 

(1 — t)q 2n \A (1 — t)o ln A 5 , „ ,„„ x 

l-t + g 2n x p(l-t + gi n x) 

If (37) has no positive root, then p Rn = 0. By (36b), we have A4 = 1 — A£. Thus, the optimal dual 

variables (a^, A5) can be derived by a two-level bisection optimization method, and the obtained 

algorithm for solving subproblem (12) is summarized in Algorithm 3. More details about the 

initialization of « 3 m i n and a 3jIIiax in Algorithm 3 are given in Appendix D. 

As previously mentioned, after solving the power allocation subproblems (11) and (12), 

problem (14) can be solved by utilizing the efficient one-dimensional search methods in [31, 

Chapter 8], thereby yielding Algorithm 4 for solving problem (10). 

E. Computational Complexity Analysis 

The computational complexity of Algorithm 2 is given by O (L(2)KNCi), where L{n) = 
O (2(n+l) 2 ln(l/e)) is the number of iterations in the ellipsoid method for an n-variable non- 
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Algorithm 4 Proposed resource allocation algorithm for solving problem (10) 
i: Input CSI {gin,g2n,gm,g2n}n=i> average powers {P U P 2 ,P R }, rate ratio p. 
2: repeat 

3: Solve the power allocation subproblems (11) and (12) by Algorithm 2 and Algorithm 3, 

respectively, where t is a given parameter. 
4: Update t using the one-dimensional search method for problem (14). 
5: until t converges. 

6: Output the optimal resource allocation {p*,p 2 ,p*ji,t*} and the optimal rate R\ 2 . 



smooth optimization problem [37, p. 155], e is the accuracy of the obtained solution, K = 
O (ln(l/e)) is the complexity (abbreviated for the complexity order) of one-dimensional search 
methods such as the bisection method, C\ is the complexity for computing the closed-form 
power allocation solution (20)-(23). The computational complexity of Algorithm 3 is given by 
O (K 2 NC 2 ), where C 2 is the complexity for computing the closed-form power allocation solution 
to (37). Therefore, the overall computational complexity of the proposed resource allocation 
algorithm (Algorithm 4) is given by O (L(2)K 2 Nd + K 3 NC 2 ). 

The complexity of the resource allocation algorithm for the per-subcarrier DF relay strategy 
in [17] is given by 0(L(2)L(3)KN(I + C 2 )), where I is the complexity of the iterative power 
allocation algorithm in Eq. (26) and (27) in [17], C 2 is the complexity of the closed-form power 
allocation solution in Eq. (28) in [17]. The complexities of the algorithm in [17] and Algorithm 4 
both grow linearly with the number of subcarriers N, and therefore they are quite appropriate for 
practically large values of N. In addition, the computational complexity of the iterative power 
allocation algorithm / is much larger than that of the closed-form power allocation solution 
C\. The computational complexity of the ellipsoid method L(3) is much larger than that of 
one-dimensional search methods K. Therefore, the computational complexity of Algorithm 4 is 
much smaller than that of the resource allocation algorithm in [17]. 

V. Asymptotic Performance Analysis 

In this section, we analyze the asymptotic rate regions of different relay strategies for two-way 
OFDM channels, including both the per-subcarrier and the proposed multi-subcarrier DF relay 
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strategies, the AF relay strategy, and the cut-set outer bound, in order to compare their achievable 
rate regions in both low and high SNR regimes and their respective performance merits. 

The cut-set outer bound for the capacity region of the two-way OFDM relay channels (l)-(3) 
is obtained by removing the sum-rate constraints in (5), which is given by [7] 

K oat (P,g) = { (R 12 ,R 21 )eR 2 + 

N 

92nPRn 



t 



{iV JV ~ 

^tlog 2 (l+^),^(l-t)log 2 (l+^ 
n=l n=l 
( N N 

R 21 < mini £ *og 2 (l + , D 1 " ^ (^TZT 

^ n=l n=l 
N >| 

< t < 1, Y,Pin < Pi, Pin > 0, i = 1, 2, R, n = 1, . . . , N \. (38) 



n=l 



The achievable rate region for the AF relay strategy is given by [17] 

N 

n AF (P,g) = { (R 12 ,R2i)e 



n=l~ - ' 92n<ln 



R12 < g2 V ~ 

n=l 

N 1 o 

R^<j:lio g ji + ^ 

^2 V l + g ln a n 

N 

^Pin < Pi, Pin > 0, i = 1, 2, R, n = 1, . . . , N \, (39) 



n=l 



where a n = -r is the amplification factor of the relay node in subcarrier n and the 

time proportion t is fixed to be 0.5 due to the incompressible nature of the AF relay strategy. 

Suppose that P = (Pi, P 2 , P R ) is a column vector constituted by nominal values of Pi, P 2 
and Pr. Then the available transmission powers of the three nodes can be expressed as 

P = xP, (40) 

where x is a positive scalar variable. Note that the average SNRs of all the wireless links are 
proportional to x, and so we will analyze the achievable rate regions of the two-way relay 
strategies under consideration for small x and large x instead. 



A. Low SNR Regime 

In the low SNR region (small x), the function log 2 (l + ax) with a > can be expressed as 
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log 2 (l + ax) = -^-x + 0(x 2 ). (41) 
In 2 

Using (41), we can show the following proposition: 

Proposition 3 For sufficiently small x>0 and any rate pair (i? 12 , R 2 \) £ 7Z. out (xP, Q), there 
exists some (Ri 2 ,R 2 i) ^T^-df(xP,Q) such that i? 12 = R 12 + 0(x 2b ) and R 21 = R21 + 0(x 2b ) 
for b > 1. The regions 7Z DF (xP, Q) and 7Z out (xP, Q) tend to be the same as x — > 0. 

Therefore, the multi-subcarrier DF relay strategy tends to achieve the capacity region of two-way 
OFDM relay channels (l)-(3) as x— >0. 

Proof: See Appendix F. ■ 
On the other hand, it can be easily shown that the achievable rate region of the AF relay 
strategy will deflate in a much faster speed than the other two-way strategies for small x, due 
to the noise amplification and propagation effects. 

B. High SNR Regime 

In the high SNR region (large x), the function log 2 (l + ax) with a > satisfies 

log 2 (l + ax) = \og 2 (ax) + 0(l/x) = \og 2 (x) + 0(1). (42) 

Let us define the multiplexing gain region of the multi-subcarrier DF relay strategy [38]: 

a ,. n m {xP,G) 

r DF = Inn — p-r — . (43) 

z->oo log 2 {X) 

Using (42), we can prove the following proposition: 

Proposition 4 The multiplexing gain region of the multi-subcarrier DF relay strategy is given 

by 

r DF = { (ru, r 2l ) r 12 + 2r 2i < N, 2r 12 + r 21 < N, r 12 , r 21 > j . (44) 

Proof: To prove (44), it is sufficient to find two rate regions TZi(xP,Q) and TZ 2 (xP,Q), 
such that 1Zi(xP,Q) C TZuf(x P,Q) C TZ 2 (xP,Q), and the corresponding multiplexing gain 
regions of 1Zi(xP,G) and 1Z 2 (xP,Q) are both given by (44). The detailed proof is given in 
Appendix G. ■ 
Actually, the multiplexing gain region r DF given by (44) depends on the time proportion allo- 
cation but not upon the power allocation, which can be observed from the proof of Proposition 4 
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in Appendix G. For instance, the simple equal power allocation scheme can achieve this mul- 
tiplexing gain region, and thereby the achievable rate region gap between this power allocation 
scheme and the optimal power allocation scheme asymptotically converges to a constant region 
gap for sufficiently large x. 

Following similar ideas for the proof of Proposition 4, one can derive the multiplexing gains 
for the per-subcarrier DF relay strategy, the AF relay strategy, and the cut-set outer bound as 
stated in the following proposition (with the proof omitted): 

Proposition 5 Let r pDF , r AF , and r out denote the multiplexing gain regions of the per-subcarrier 
DF relay strategy, the AF relay strategy and the cut-set outer bound, respectively. Then r pDF = 
r DF (given by (44); and r AF = r out = {(r 12 ,r 21 ) r 12 < f , r 21 < f , r 12 ,r 21 > oj. 

Proposition 5 implies that the AF relay strategy can achieve the multiplexing gain region of 
the two-way OFDM relay channels, while the performance of both DF relay strategies is worse 
than that of the AF relay strategy in the high SNR regime. An illustrative example for these 
analytical results is given in Fig. 3. To the best of our knowledge, the multiplexing gain region 
of the cut-set outer bound was derived in [11], while the multiplexing gain regions of the DF 
and AF relay strategies have not been reported in the open literature before. All the analytical 
results as presented in Propositions 3-5 will be confirmed by our numerical results in the next 
section. 

VI. Numerical Results 

We now provide some numerical results to compare the performance of different two-way 
OFDM relay strategies under optimal resource allocation. The wireless channels are generated 
by using M = 4 independent Rayleigh distributed time-domain taps. The number of subcarriers 
in the OFDM channel is N = 16. We assume that the wireless channels between Tj (i £ {1, 2}) 
and T R are reciprocal, i.e., g in = g in , for all z = 1, 2, n = 1, . . . , 16. The maximum average 
transmission powers for all the nodes are assumed to be the same, i.e., P\ — P 2 — Pr — P. 
Therefore, the average SNR of the wireless links between Ti (i E {1,2}) and T R is given by 
SNR* =E[<y£. 

We consider the following two-way relay strategies in our numerical comparisons: the multi- 
subcarrier DF relay strategy proposed in Lemma 1, the per-subcarrier DF relay strategy [17], 
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Fig. 3. Comparison of multiplexing gain regions of different two-way OFDM relay strategies in the high SNR regime. 



the AF relay strategy [17], and the cut-set outer bound [7]. The associated rate regions of these 
strategies are given by 1Z DF (P, Q) in (5), 7e p , DF (P, Q) in (6), TZ AF (P, Q) in (39), and TZ out (P, Q) 
in (38), respectively. The optimal resource allocation of the multi-subcarrier DF relay strategy is 
obtained by Algorithm 4, the optimal resource allocation of the per-subcarrier DF relay strategy 
and the AF relay strategy are carried out based on the power allocation algorithms proposed in 
[17], and the optimal resource allocation of the cut-set outer bound is obtained by a simpler 
version of Algorithm 4. 

Figures 4(a)-4(d) provide the rate regions of these relay strategies for four symmetric SNR 
scenarios with SNR4 = SNR 2 = SNR = 0, 10, 20, 30 dB, respectively. Some observations from 
these figures are worth mentioning: First, the achievable rate region of the multi-subcarrier 
DF relay strategy is always larger than that of the per-subcarrier DF relay strategy. Second, as 
the SNR decreases, the achievable rate region of the multi-subcarrier DF relay strategy tends to 
reach the cut-set outer bound. Third, the achievable rate region of the AF relay strategy grows 
with SNR, but it is still a subset of those of the DF relay strategies for SNR < 20 dB; this is no 
longer true for SNR = 30 dB. Finally, in the high SNR region, the rate regions of these strategies 
tend to be dominated by the multiplexing gain region, thereby consistent with Propositions 4 and 
5. To be more specific, the shape of the outer bound tends to be a rectangle depending on the 
SNR. The achievable rate region of the AF strategy is closer to the outer bound for the higher 



23 



0.8 
0.7 
0.6 

^.5 



s 0.4 
e 

cS'o.3 



0.2 
0.1 




Cut-set Outer Bound 

Multi-subcarrier DF 

— ■ — ■ Per-subcarrier DF 
AF 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 O.f 
R, 2 (bits/s/Hz) 

(a) SNR = 0dB. 



Cut-set Outer Bound 
Multi-subcarrier DF 
Per-subcarrier DF 
AF 




0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 
R, 2 (bits/s/Hz) 

(c) SNR = 20dB. 
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Fig. 4. Achievable rate regions of four two-way OFDM relay strategies for four symmetric SNR scenarios (i.e., 
SNRi = SNR 2 = SNR), including (a) SNR = 0dB, (b) SNR=10dB, (c) SNR = 20dB, and (d) SNR = 30dB. 



SNR, but that of the two DF strategies are not. However, for the low SNR, only the proposed 
multi-subcarrier DF strategy can approach the outer bound. 

Figure 5 provides the optimal channel resource allocation result t* versus the rate ratio 
P = R21/R12 of the multi-subcarrier DF strategy and the cut-set outer bound for SNR1 = SNR 2 = 
20 dB. When < p < 0.2 or p > 5, the optimal t* of the multi-subcarrier DF strategy and the 



24 



0.64 
0.62 
0.60 
0.58 
t* 0.56 
0.54 
0.52 
0.50 
0.48 

0.01 0.10 1.00 10 100 

P = R21/R12 

Fig. 5. The optimal time proportion of the multiple-access phase t* versus the rate ratio p = -R21 /-R12 of the multi-subcarrier 
DF strategy and the cut-set outer bound for SNRi = SNR 2 = 20 dB. 

cut-set outer bound are the same; when 0.2 < p < 5, the optimal t* of the multi-subcarrier DF 
strategy is larger than that of the cut-set outer bound due to the additional sum-rate constraint, 
and the maximal t* is achieved at p = 1. Figure 6 shows the achievable rate region of the multi- 
subcarrier DF relay strategy obtained by solving problem (10) using CVX, and that obtained by 
using Algorithm 4, justifying that they yield the same numerical results as expected. 

Figure 7(a) and 7(b) illustrate the rate regions of these relay strategies for two asymmetric SNR 
scenarios, including (SNRi, SNR 2 ) = (10 dB, 5 dB) and (SNRi, SNR 2 ) = (30 dB, 5 dB). Similar 
observations from Figure 4 can be seen in Figure 7 as well. 

Finally, Figure 8 shows some results (the achievable rate versus average SNR) of these 
relay strategies for the symmetric SNR symmetric rate scenario, i.e., SNRi = SNR 2 = SNR and 
R 12 = i?2i- The numerical results in Fig. 8 were obtained by averaging over 500 fading channel 
realizations. One can see from this figure that, in the low SNR regime, the multi-subcarrier 
DF relay strategy tends to have the same performance as the cut-set outer bound, and that 
the multi-subcarrier DF relay strategy performs better than the AF relay strategy in the low to 
moderate SNR regime, i.e., SNR < 24 dB. Moreover, the multi-subcarrier DF relay strategy with 
the optimal resource allocation performs better than with the equal power allocation and the 
optimal t* used; it also outperforms the per-subcarrier DF strategy. 

By Proposition 5, in the high SNR regime, the multiplexing gains of the AF relay strategy and 
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Fig. 6. Achievable rate region of the multi-subcarrier DF relay strategy obtained by using Algorithm 4 and that obtained by 
solving problem (10) using the convex solver CVX for SNRi = SNR2 = 20 dB. 

the cut-set outer bound are the same; the multiplexing gains of the two DF relay strategies are 
also the same; the multiplexing gain of the AF relay strategy is larger than that of the DF relay 
strategy (implying better performance for the former than the latter for sufficiently high SNR); 
both the equal power allocation and the optimal power allocation for the multi-subcarrier DF 
strategy achieve the same multiplexing gain, and the rate gap between them tends to a constant 
value as SNR increases. All these analytical results have been substantiated by the numerical 
results shown in Figures 4-8. 

VII. Conclusion 

We have analytically shown that the widely studied per-subcarrier DF relay strategy is only a 
suboptimal DF relay strategy for two-way OFDM relay channels in terms of achievable rate 
region as the direct link between the two terminal nodes is too weak to be used for data 
transmission. We have presented a multi-subcarrier DF relay strategy that can achieve a larger 
rate region than the per-subcarrier DF relay strategy. Although the optimal resource allocation 
for the proposed multi-subcarrier DF relay strategy can be formulated as a convex optimization 
problem which can be solved by off-the-self convex solvers, we have presented a computationally 
efficient algorithm (Algorithm 4) for obtaining the optimal resource allocation together with 
its complexity analysis. Then we have presented an analysis of asymptotic performance for 
the above two DF strategies, the AF strategy, and the cut-set outer bound, justifying that the 
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Fig. 7. Achievable rate regions of four two-way OFDM relay strategies for two asymmetric SNR scenarios, including (a) 
SNRi = 10 dB, SNR 2 = 5 dB, and (b) SNRj = 30 dB, SNR 2 = 5 dB. 

proposed multi-subcarrier DF relay strategy is suitable for the low to moderate SNR regime, 
while the AF strategy is suitable for the high SNR regime (as stated in Propositions 3 to 5). 
Some numerical results have been presented to demonstrate all the analytical results and the 
efficacy of the proposed optimal resource allocation algorithm. 

Appendix A 
Proof of Lemma 1 

According to Theorem 2 of [7], the optimal achievable rate region of discrete memoryless 

two-way relay channel with a DF relay strategy is given by the set of rate pairs (Ru, R21) 

satisfying r , 

R 12 <mm{tI(X l ;Y R \X 2 ),(l-t)I(X R ;Y 2 )}, (45a) 

R 21 < min{tI(X 2 ; Y R \X X ), (1 - t)I(X R ; Y 1 )} , (45b) 

Rn + R2i<tI(X 1 ,X 2 ;Y R ), (45c) 

where Xi and Y", (i=l, 2, R) are the input and output symbols of the channel at the terminal and 

relay nodes, respectively. 

In the two-way parallel Gaussian relay channel, the channel input and output symbols are 

given by the vectors Xi = . . . ,X iN ) and Y~j = (Y^, . . . ,Y iN ), respectively. The mutual 

information terms in (45a)-(45c) can be maximized simultaneously with the following channel 

input distributions [22, Section 9.4]: 
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Fig. 8. Achievable rate performance comparison of two-way OFDM relay strategies, where SNRi = SNR2 = SNR, R12 = R21, 
"RA" stands for resource allocation, and "equal PA" stands for equal power allocation in the legend. 



1) The elements of channel input X in should be statistically independent for different n; 

2) The elements of channel input X in should be Gaussian random variables with zero mean 
and unit variance. 

By applying these channel input distributions, and by further considering the power and channel 
resource constraints, the achievable rate region (5) is attained. 

Appendix B 
Proof of Proposition 1 

The optimal primal variables (Pi,p$) and the optimal dual variables (A*, a*) to problem (11) 
must satisfy the KKT condition 

^ = K + \* 2 + K-l = 0, (46) 
and the complementary slackness conditions 

K [Ka - r k (p*i,P$\ = 0, k = 1, 2, 3. (47) 
By (46), the optimal dual variable A* has at most two independent variables, i.e., = 1 — A£ — A3. 
For convenience, /^(p^p^) i s simply denoted as r£ for k — 1, 2, 3. If r* 7^ then by the 
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complementary slackness conditions in (47), the optimal dual variable A* must satisfy either 
A* = (1— A3, 0, A3) with A2 = or A* = (0, 1-A3, A3) with A* = 0, and the asserted statement 
is thus proved. Therefore, we only need to consider the case of r\ = r\. 
It can be easily seen from (13a)-(13c) that 

N 

r\ + pr\ = t £ log 2 (l + 9lnPtn I 92nPln + 9ln92 f A ) > (p + l)r* (48) 

n=l 

and the equality holds in (48) if and only if vXrjAn = f° r n = 1, • • • , -AT. This leads to two 
cases to be discussed as follows: 
Case 1: r* + pr\ > (p + 1)7-3. 

Since r\ = r£, we have that r* A < -^r\ + -^i r \ — f\ — t\. If A* > and \ 2 > 0, then the 
complementary slackness conditions in (47) imply R MA = r\ = r 2 > rjj, which contradicts with 
the rate constraint R MA < 7-3. Therefore, A^ and X 2 can not be both positive, and Proposition 1 
is thus proved in Case 1. 

Case 2: r\ + pr\ = (p + l)r£. 

Since r\ = r 2 , we have R MA = r\ = r 2 = 7*3. If problem (17) has an optimal dual solution 
(A*, ot*) with A^Aj = 0, the optimal dual variable A* already satisfies either A* = (1— A3, 0, A3) 
or A* = (0,1— A3, A3). Suppose that there is an optimal dual point A* satisfying A* > and 
A2 > 0, we will construct another optimal dual solution with the desired structure stated in 
Proposition 1. 

By (48), r\+pr 2 = (p+l)r% happens only if the optimal primal solution satisfies p\ n p 2n = f° r 
all n. Let us define I\ C J\f = {1, . . . , N} as the index set of subcarriers with p\ n > Q,p 2n = 0' 
and X 2 C J\f with p\ n = Q,p 2n > 0- The optimal primal variables (p*, p 2 ) and the optimal dual 
variables (A*, at*) to problem (11) must satisfy the following KKT conditions: 

dL MA 



Al + ^ + A3 - 1 = , (49a) 

, nelu (49b) 



<9i?MA 

dL MA , tg ln [\* + (p + 1)X{] I > 0, if vXn = 



dpin (P + 1) it + ginPln) ln 2 1=0, if p\ n > 

din* * tg 2n [p\t + (p+l)\* 2 } f>0, ifP2n = 

~» = a 2 7 — , 1W , , * vi o \ ' neJ 2, (49c) 

<?P2n p(p + 1) (t + # 2 n?4J ln 2 1=0, if f 2n > 

A* > 0, k = 1,2,3, (49d) 
KA~r* k <0, fc = l,2,3, (49e) 
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K(Ka-4)=0, fc = l,2,3, (49f) 
<>0, z = l,2, (49g) 
En=iP*n-^<0, i = l,2, (49h) 

a?(EjLiPS»-J o 0=°. < = 1 .?: _ _ i 49i ) 

If X* >^A£ > 0, we define a new dual point A = (AJ-±A£, 0, A^+^A^). Since Ai+A 2 +A 3 = 

A^+A^+A^, A 3 +(p+l)Ai = A*+(p+l)A^, pA 3 +(p+l)A 2 = pA^+(p+l)A$, and R^ A = r* = 
r\ = r 3 , the dual point (A, a*) and the primal point {p\,p%) also satisfy the KKT con- 
ditions (49a)-(49i). Therefore, A is an optimal dual solution of problem (17) that satisfies 
A=(1-A 3 ,0,A 3 ). 

If < A* <-i-A2, similarly we can define another dual point A = (0, X* 2 — pA*, A3+(p+l)A*) . 
Since A!+A 2 +A 3 = A^+A^+A^, A 3 + (p+l)Ax = X\+(p+l)X\, pA 3 + (p+l)A 2 = pX* 3 +(p+l)X*, 
and i?MA = r i = r 2 = r 3' tne dual point (A, a*) and the primal point (p\,p%) also satisfy the 
KKT conditions (49a)-(49i). Therefore, A is an optimal dual solution of problem (17) that satisfies 
A = (0, 1— A3, A 3 ). Hence, the statement of Proposition 1 has been proved for Case 2. 

Appendix C 

Closed-form Solution to (19) without Using Proposition 1 
(Discussed in Remark 1) 

When the structural property in Proposition 1 is not available, the primal power allocation 
solution is more complicated for the case of p* n > 0, p 2n > 0. In this case, the KKT conditions 
(19a) and (19b) both hold with equality. Therefore, we need to solve a system of quadratic 
equations with two variables. To simplify this problem, we define an auxiliary variable 

X = QlnPln + 92nV\ n - ( 50 ) 

Then, by (19) and through some derivations, we obtain 

D * = t(p + l)X 1 t_ 

Pln ai (p+l)\n2-tg ln X 3 /(t + x) g ln ' K > 

r>* = *(p + Wp t 

F2n a 2 (p+ l)ln2-tg 2n X 3 /(t + x) g 2n 
By substituting (51a) and (51b) into (50), we end up with the following cubic equation of x: 

tjhn(p_± l)Ai + tg 2n (p+ l)A 2 /p = x + 2t (52) 

a 1 (p + l)ki2-tg ln \ 3 /(t + x) a 2 (p + l)ln2-tg 2n X 3 / (t + x) 

It is widely known that the closed-form solutions of the cubic equation x 3 + ax 2 + bx + c = 
are given by Cardano's formula [39], i.e., 
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J6i 



q/2 



q/2 



a/3, 



x 2 = cue 



3O1/3 3 



x 3 =u> 2 e>Wt 



-q/2 + VA\+0J 2 e^ 3 ? 

,382/3 3 



g/2 - V A - a/3 



q/2 + y/A\ 



(53a) 
(53b) 
(53c) 



-g/2 - VA| - a/3, 

where p = -a 2 /3 + 6, g = 2a 3 /27 - a6/3 + c, w = -1/2 + j\/3/2, A = p 3 /27 + g 2 /4, 0j = 
angle {—q/2 + vA), 6> 2 = angle {—q/2 — y/~K) , and angle (•) denotes the phase angle of an 
complex number. If A > 0, the cubic equation has one real root and a pair of conjugate complex 
roots; if A < 0, the cubic equation has three real roots. 

After obtaining the positive real root x of (52), we can easily obtain the optimal p\ n and p\ n 
by substituting x into (51), which is the closed-form power allocation solution. 



Appendix D 
Bounds for Power Dual Variables 

The optimal a*(A) = (a^, ol%) for problem (24) must satisfy the KKT conditions (19), and 
there must exist ri\ (l<n 1 <A r ) such that p\ ni > 0. Thus, by the feasible conditions p* n > 
{i=l, 2, 71=1, . . . , N) and (19a) with n = n 1 , we can obtain an upper bound for a\, 



a 



tg lni \ 3 /{p + l] 



{t + giniPim + 92mP2ni) ln 2 (* + 9lmP* ni ) ln 2 
g lm [{p + l)Ai + A 3 ] ^ {p + l)Ai + A 3 



< 



< 



max{g ln }, 



(p + l)ln2 - (p+l)ln2 
and a trivial lower bound o>\ > 0. 

Similar discussions can be applied for ol\ and the optimal ol\{\§) for problem (35), and thus 
we can attain that 



n <? ^ (p + l)A! + A 3 



= a2,min < at < a 2 -_ 



{p+ l)ln2 
(p+ 1)A 2 + A 3 



max{#2n}, 



= a 3imin < ag < a 3 



,max — max 

n 



(p+l)m2 

P^2n(l-A 5 )+^lnA 5 



pln2 



(54a) 
(54b) 
(54c) 



By (54a) and (54b), we can initialize a and the matrix A in Algorithm 1 as 



a = (ai,max/2,ai >max /2), A 



a 



l,max 



/4 







a2,max/4 

and the initialization of ct 3 min and «3 ]max in Algorithm 3 is given by (54c). 



(55) 
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Appendix E 
Proof of Proposition 2 

By (48), it is known that n + pr 2 > (p + l)r 3 . This leads to two cases to be discussed: 
Case 1: r\ + pr 2 > (p + l)r 3 . 

In this case, if r 3 > n, we have p(r 2 — r 3 ) > r 3 — n > 0. Hence, r 2 > r 3 . Assume A* = 
(0, l-Ag, A 3 ) G A 2 \{A }, which means < A 3 < 1, and then we have 

(l-A*)(r 3 -r 2 ) <0. (56) 
On the other hand, since A* = (0, 1— A3, A3) is an optimal dual point, by (28) and (30), it must 
be true that 

(A* - A°) T £(A°) = (1 - A*)(r 3 - r 2 ) > 0, (57) 
which leads to a contradiction with (56). Thus, A* A 2 \{A }. By Proposition 1, we must have 
A*GAi. 

Similarly, if r 3 > r 2 , we can show that A* G A 2 . 
Case 2: r x + pr 2 — (p + l)r 3 . 

If only one of the inequalities of r 3 > ri and r 3 > r 2 is satisfied, similar to Case 1, we can 
show that A* G Ai if r 3 > ri and A* G A 2 if r 3 > r 2 . 

If both r 3 > ri and r 3 > r 2 , we have r x = r 2 = r 3 = i?J^ A by the condition of Case 2. Thus, 
according to (28), the subgradient £(A°) = 0. Substituting this into (29) yields 

Gma(A) < G M a(A°), V A G A a U A 2 , (58) 
which means that A itself is an optimal solution to (27). i.e., A* = A . 

Appendix F 
Proof of Proposition 3 

For sufficiently small x, since pj n < Pi = xPi, pi n is of the order 0(x b ) for b > 1, and pf n is 

of the order 0(x 2b ). By this and (41), each rate pair (i?i 2 ,-R 2 i) G 1Zdf(xP, Q) satisfies 

If* * 1 

J]^^^ + 0(x 26 ), (59a) 

^ n=l n=l ^ 

If* * 1 

-R21 < — -min< y^g 2n .p27i, ^InPftif+O^), (59b) 

71=1 71 = 1 

1 ^ 

R l2 + R 2l < — V (# ln p ln + # 2n ,p 2n ) + 0(x 2b ). (59c) 
In 2 z — ' 



n=l 
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By taking the summation of (59a) and (59b), it is easy to see that the sum-rate constraint (59c) 
always holds if (59a) and (59b) are true. In other words, the achievable rate region TZ DF (xP, Q) 
can be expressed by (59a) and (59b) for sufficiently small x. On the other hand, the cut-set outer 
bound region 1Z out (xP,<3) is also described by (59a) and (59b). Taking resource allocation into 
account, both 1Z DP (xP, Q) and lZ out (xP, Q) are in the form of 



for sufficiently small x. Therefore, for any rate pair (i? 12 , R21) £ 'R-om.i.xP, G), there exists some 
(■R12, R21) e n DF (xP, G) such that R 12 = R 12 + 0(x 2b ) and R 21 = R 21 + 0(x 2b ) for b > 1. Now 
let x — > 0, the rate regions 7?. DF (xP, Q) and 1Z out (xP, Q) tend to be the same. 

Appendix G 
Proof of Proposition 4 

First, we consider an equal power allocation scheme p in = Pi/N = xPi/N, which leads to a 
rate region given by 




(60) 




71=1 

N 




n=l 




N 



71=1 




A' 



Since this equal power allocation scheme is feasible, the rate region (61) satisfies 
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n^xP^g) cn DF (xP,g). 



(62) 



All the rate functions in (61) are of the form log 2 (l + ax) for a > 0. According to (42), it can 
be expressed by log 2 (x) + 0(1) for sufficiently large x. By this, for sufficiently large x, we can 
obtain that 



n^xP.g) = I {Ri2,R 2 i)e. 



N 



J Rl2<^tl0g 2 (x)+0(l), 

n=l 

N 

R 21 <J2t\og 2 (x) + 0(l), 

n=l 

N 

R12 + R21 < ^tlog 2 (x) +0(1), 

n=l 

N 

Ri2<^(l-t) log 2 (a;) + 0(l), 

n=l 

N 

R21 < 5^(1 - t) log 2 (x) + 0(1), < t < 1 \. (63) 



n=l 



Hence, similar to the definition (43), the multiplexing gain region of TZi(xP, Q) is given by 



{(ria, 



(64) 



r 12 < tN, r 21 < tN, r 12 + r 21 < tN, 
ris < (1 - t)iV, r 21 < (1 - t)iV, < t < 1, r 12 , r 21 > o}. 

After some simple manipulations, (64) can be simplified as (44). 

Then, we consider an infeasible power allocation scheme pi n = Pi = xPi, which results in 
the following rate region: 



n 2 (xP,g) = {(R 12 ,R 21 )e. 



N 

#l 2 <X> l0 g 2 (l- 



9lnXPl 



n=l 

N 



R2i<J2 tlo ^{ 1 - 



g2nXP 2 



n=l 



N 

Rl2 + R21 < J]] 



g Xn xP x + g 2n xP 2 



n=l 



t 



N 

i? 12 <5>-t)io g2 (i+^ 



n=l 

N 



t 



^i<53(l-t)log 2 (l+^^),0<t<l[. (65) 



n=l 



It can be easily seen from (5) that the rate region (65) satisfies 
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n DF (xP,g) c n 2 (xP,g), (66) 

and the multiplexing gain region of 7£ 2 (xP, Q) is also given by (64), and thus (44). Therefore, 
the asserted statement follows from (62) and (66). 
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